Evolution of Reinforcement Learning in Uncertain Environments: Emergence of Risk-Aversion and Matching

نویسندگان

  • Yael Niv
  • Daphna Joel
  • Isaac Meilijson
  • Eytan Ruppin
چکیده

Reinforcement learning (RL) is a fundamental process by which organisms learn to achieve a goal from interactions with the environment. Using Artificial Life techniques we derive (near-)optimal neuronal learning rules in a simple neural network model of decision-making in simulated bumblebees foraging for nectar. The resulting networks exhibit efficient RL, allowing the bees to respond rapidly to changes in reward contingencies. The evolved synaptic plasticity dynamics give rise to varying exploration/exploitation levels from which emerge the welldocumented foraging strategies of risk aversion and probability matching. These are shown to be a direct result of optimal RL, providing a biologically founded, parsimonious and novel explanation for these behaviors. Our results are corroborated by a rigorous mathematical analysis and by experiments in mobile robots.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Foraging Behaviors Evolution of Reinforcement Learning in Uncertain Environments: A Simple Explanation for Complex

1 Introduction Reinforcement learning (RL) is a process by which organisms learn from their interactions with the environment to achieve a goal (Sutton & Barto, 1998). In RL, learning is contingent upon a scalar reinforcement signal that provides evaluative information about how good an action is in a certain situation, without providing an instructive supervising cue as to which would be the p...

متن کامل

Overcoming Learning Aversion in Evaluating and Managing Uncertain Risks.

Decision biases can distort cost-benefit evaluations of uncertain risks, leading to risk management policy decisions with predictably high retrospective regret. We argue that well-documented decision biases encourage learning aversion, or predictably suboptimal learning and premature decision making in the face of high uncertainty about the costs, risks, and benefits of proposed changes. Biases...

متن کامل

Dynamic Obstacle Avoidance by Distributed Algorithm based on Reinforcement Learning (RESEARCH NOTE)

In this paper we focus on the application of reinforcement learning to obstacle avoidance in dynamic Environments in wireless sensor networks. A distributed algorithm based on reinforcement learning is developed for sensor networks to guide mobile robot through the dynamic obstacles. The sensor network models the danger of the area under coverage as obstacles, and has the property of adoption o...

متن کامل

Does Learning Elicit Neuromodulation? Evolutionary Search in Reinforcement Learning-like Environments

Although the importance of neuromodulation in neural substrates has been widely recognised, the computational role, characteristics and advantages of such models in Artificial Neural Networks are mostly unknown. To investigate this issue, here the autonomous emergence of neuromodulatory structures is considered by means of artificial evolution in reinforcement learning-like environments. By giv...

متن کامل

ON THE MATCHING NUMBER OF AN UNCERTAIN GRAPH

Uncertain graphs are employed to describe graph models with indeterministicinformation that produced by human beings. This paper aims to study themaximum matching problem in uncertain graphs.The number of edges of a maximum matching in a graph is called matching numberof the graph. Due to the existence of uncertain edges, the matching number of an uncertain graph is essentially an uncertain var...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001